2009 Digital Image Computing: Techniques and Applications 



Automatic tuning of MST segmentation of mammograms for 
registration and mass detection algorithms 

Mariusz Bajger, Fei Ma, Murk J. Bottema 
Sdiool of Computer Science, Engineering and Mathematics 
Flinders University, Adelaide, SA 5001, Australia 
{mbajger, ma029, murkb}@csem. flinders. edu.au 



Abstract 

A technique utilizing an entropy measure is de- 
veloped for automatically tuning the segmentation of 
screening mammograms by minimum spanning trees 
(MST). The lack of such technique has been a major 
obstacle in previous work to segment mammograms for 
registration and applying mass detection algorithms. 
The proposed method is tested on two sets of mammo- 
grams: a set of 55 mammograms chosen from a publicly 
available Mini-MIAS database, and a set of 37 mam- 
mograms selected from a local database. The method 
performance is evaluated in conjunction with three dif- 
ferent preprocessing filters: gaussian, anisotropic and 
neutrosophic. Results show that the automatic tuning 
has the potential to produce state-of-the art segmenta- 
tion of mass-like objects in mammograms. The neutro- 
sophic filtering provided the best performance. 



1 Introduction 

Anomalies in mammograms associated with cancer, 
especially those associated with early cancer, often re- 
semble normal tissue patterns. This complicates the 
interpretation of mammograms for computer-aided di- 
agnostic (CAD) schemes as well as human readers. 
The static resemblance between normal and diseased 
tissue may be mitigated through bilateral comparison 
or, when available, temporal comparison. Human ob- 
servers are often able to perform such comparisons very 
well, but computer algorithms for this task are prob- 
lematic. Central to CAD comparison of mammograms 
is image registration. In order to decide if there are 
new anomalies, evidence of changes in appearance of 
anomalies, or if patterns between the left and right 
breast are different, algorithms must have the ability 
to associate objects between images. The registration 
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process is increasingly important for the detection of 
pathologies at early stages [10]. 

Two general classes of image registration techniques 
are local matching of pixel intensity values and the use 
of landmarks. Many registration schemes use a combi- 
nation of both these approaches. However, neither of 
these techniques are particularly well suited for regis- 
tration of mammograms. The matching of local pixel 
intensities is compromised by the fact that the same 
object in two views of the same breast may be repre- 
sented by very different sets of intensity values. This 
is due, in turn, to the contortion of breast tissue dur- 
ing image acquisition, differences in exposure and posi- 
tioning as well as normal changes of surrounding breast 
tissue over time. Landmarks in mammograms may in- 
clude the border of the pectoral muscle, the nipple, 
and the breast boundary [10]. The nipple is often dif- 
ficult to detect and the breast boundary is not entirely 
invariant between different images of the same breast 
due to the inconsistencies of image acquisition men- 
tioned above. The pectoral muscle boundary does not 
change substantially over time, is not subject to distor- 
tion at acquisition and is usually easy to delineate near 
the top of medio lateral views. However, the bound- 
ary tends to fade below the middle of the image and 
is absent in cranial caudal views. In addition, these 
landmarks are not generally the objects of interest in 
screening mammograms; at best they serve as interme- 
diates toward registration and subsequent detection or 
characterization of anomalies associated with disease. 
Thus conclusions regarding the disease state of tissue 
requires first the accurate detection of the landmarks, 
then correct registration and finally correct detection 
of the anomalies of interest. 

This paper reports a key step in replacing regis- 
tration of mammograms based on general landmarks 
by registration of the objects of interest themselves. 
Specifically, the goal is to segment mass-like regions in 
the mammogram for the future purpose of studying the 
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similarity and differences between mass-like objects in 
different views (say from different screening visits) of 
the same breast. This context is important for formu- 
lating the exact objective of the segmentation and for 
evaluating the performance. 

Numerous algorithms have appeared for automatic 
detection of masses in screening mammorgrams. Gen- 
erally, the steps are to find the boundary of the 
anomaly as accurately as possible, measure shape, size, 
contrast, and texture features within the anomaly, 
around the anomaly or on the border and classify the 
objects as a mass or non-mass or as a malignant or be- 
nign mass. For these schemes, the boundary must be 
such as to maximally capture differences in the mea- 
sured features inside and outside (or on the border of) 
the anomaly. Normally, an assumption is made that 
such a boundary coincides with the visually apparent 
boundary. 

In contrast, the objective here is to identify objects 
that could possibly be masses and identify the general 
shape, size, contrast, location, ect. only. The exact 
boundary is not needed and is not even desirable since 
the appearance of the object in another view of the 
same breast may not be the same. For this reason, the 
performance of the segmentation should not be mea- 
sured by comparing boundaries of objects with ’’true” 
boundaries drawn by an expert. The ultimate measure 
of segmentation performance is the ability of the final 
scheme to detect breast cancer, but this is an unre- 
alizable criterion during the development stage. The 
approach taken here is to compare segmented regions 
with general information of anomalies provided by an 
expert. Instead of a carefully drawn boundary, the ex- 
pert information consists of a centerpoint indicating 
the location of the anomaly and a circle or a box that 
roughly indicates the size. 

The segmentation method used is a graph-theory 
method based on minimum spanning trees (MST) [2]. 
MST segmentation is attractive in this setting for a 
number of reasons. First, methods based on graphs 
are able to combine both global and local image in- 
formation in the segementation process and, among 
graph based methods, the MST is fairly fast. Second, 
previous work indicates that this method is quite ro- 
bust to shifts, rotations and warping of the type ex- 
pected when comparing sequential screening mammo- 
grams [8]. Third, applying the MST algorithm requires 
choosing only one parameter. This is a distinct advan- 
tage over methods that rely on a larger collection of 
parameters that must be chosen at hoc for each class 
of images or for each individual image. The single pa- 
rameter needed to run MST is denoted by k and con- 
trols the granularity of the resulting segmentation [2], 



[9]. Because k has a predictable effect on the nature 
of the segmentation, it can be used to tune the process 
optimaly. This approach was used in [15] to detect op- 
timal accurate boundaries of the pectoral muscle line 
in mammograms using the image entropy measure in- 
troduced in [18]. 

In the present work, the granularity parameter, k, is 
used to optimize the MST algorithm for the purpose of 
identifying mass-like regions. The motivation for this 
approach comes from the variety of breasts encoun- 
tered in screening mammography. Some breasts appear 
in mammograms as essentially featureless with little 
dense tissue and very few features which could be in- 
terpreted as mass-like. Other images, sometimes called 
’’busy breasts” , comprise a variety of intensity variation 
and many mass-like regions. In each case, better seg- 
mentation results if a good ballance is achieved between 
the use of overall variation of image intensity and local 
variation associated with mass-like regions. This moti- 
vates measuring the criterion for optimal segmentation 
as a balance between overall and local image entropy 
as in [15] and [18]. 

Since MST segmentation is very sensitive to noise, a 
denoising filter has to be used before the segmentation 
process is performed. In what follows this problem is 
addressed in depth by quantifying the results of seg- 
mentation after preprocessing by a gausian filter (most 
often used in the literature for MST segmentation), an 
anistropic filter ([11]), and a very new, neutrosoplric 
denoising method ([4]). 



2 MST segmentation 



The MST segmentation method was proposed in [2] 
and was adapted for mammography in [9]. The key 
structures needed for the present application are as fol- 
lows. 

Let G = (V, E) be an undirected graph where, V, 
the set of vertices, is taken to be the set of pixels 
that form the image and E is a set of edges that con- 
nect neighboring pixels. Elements of E are denoted by 
( Vi,Vj ) where i y and Vj are elements of V. Every pair 
of vertices is assigned a weight, w, by 






\I(vi) ~ I(vj )\ , (vi,Vj)eE, 
oo, otherwise, 



where / (u*) is the image intensity at ty. A compo- 
nent of G is a graph C = (V', E'), where V' C V and 
E 1 C E and with the property that for every pair of 
vertices in V’ there exists a sequence of edges in E' 
such that the catenation of the sequence connects the 
two vertices. A tree that spans a component C and has 
a minimum total weight is called a minimum spanning 
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tree of C (MST(C)). A segmentation of G is a collec- 
tion of components Ck = ( Vk,Ek ) such that UVfc = V 
and Vi D Vj = 0 for * 7 ^ j. 

For a component C € G and minimum spanning tree 
MST(C), the internal difference, Int(C ), is defined as 
the largest weight in MST(C'). For two components C\ 
and C 2 , their difference Diff(C\, C 2 ) is defined as the 
minimum weight connecting C\ and C 2 . 

The MST algorithm builds a new graph H = ( V , F ). 
Initially, F = Fq is the empty set, meaning that each 
vertex (pixel) is a component unto itself. In subsequent 
iterations, components C\ and C 2 are merged, meaning 
that Ci and C 2 form a single component, if 

Dif f(Ci,C 2 ) < 

min (/nt(C'i) + t(Ci), Int(C 2 ) + t(C 2 )) , 

where the threshold function r is given by t(C) = ^ , 
\C\ denotes the number of elements in C, and A: is a 
constant. The process terminates when no new mergers 
are allowed. 

The constant k controls the granularity and hence 
the quality of the segmentation. Even very close val- 
ues of k may produce very different segmentations, as 
the Figure 1 shows. In most applications of the MST 
algorithm, k is fixed empirically for a class of images 
with the result that the granularity is far from optimal 
for some members of the class. In this paper, k is ad- 
justed automatically for each image by optimizing the 
measure of entropy described in the next section. 

3 Entropy of a segmentation 




Figure 1. Granularity of a segmentation, 
(a) The original mammogram (mdb091 from 
Mini-MIAS database) showing the location of 
the mass, (b) The contour zoomed, (c) The 
segmented mass for k = 28. (d) The seg- 
mented mass for k = 29. 



where FlffCj) is the entropy of attribute p for compo- 
nent Cj. In [18], luminance was used as the attribute 
p. In this paper, p is the image intensity value of the 
pixel. Denoting by Mj the set of values associated with 
feature p in component Cj and by Lj(m ) the number 
of pixels in component Cj with value m for feature p, 
the entropy of component Cj is expressed as 



The following function of an image I, was proposed 
in [18] as a measure of effectiveness of an image seg- 
mentation: 

H(I) = H l (I) + H r (I). (1) 

The first term - the layout entropy - measures the global 
image disorder (generally it increases with the number 
of components), and is defined by the formula 



W) 



E 



j = 1 




m ’ 



where |/| is the area of the whole image and Cj is the 
area of the j-tli component. 

The second term - the region entropy - measures 
the uniformity within components (it decreases when 
the number of regions increases), and is given by the 
formula 

Hr(I) = E 

3 = 1 1 1 



MCj) = - E 

m£Mj 



I Cj\ 



log 



L j( m ) 

\Cj\ ' 



4 Neutrosophic image denoising 

The concept of neutrosophic image denoising was 
introduced in [3] and successfully applied to segmen- 
tation of natural and synthetic images in [4]. In [3] 
the neutrosophic filter (NS filter) was proposed for im- 
age denoising and a comparison with the median and 
mean filter was reported. The NS filter outperformed 
the other filters when applied to images with different 
kinds and levels of noise. 

For convenience we briefly describe NS filtering. An 
image P is converted into its neutrosophic form by 
transforming each pixel P{i,j) into its neutrosophic 
equivalent Pvs(bj) = {T(i,j),I(i, j), F(i, j)}, where 
the coordinates indicate probabilities of the pixel mem- 
bership in a region; T% true, /% indeterminate and 
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F% false. The membership values are defined as 



T(i,j) 

F(iJ) 



dii'i 3 ) 9min 
9 max 9 min 

j') $min 
8 max & min 

I g(i,j) I, 

1 -T(i,j), 



where g(i,j) is the intensity value of P(i,j), g(i,j) is 
the local mean value of g(i,j). The local max/min 
values for g and S are calculated for symmetric neigh- 
bourhoods of (i.j) with radius w. 

The a-filtering operation Pns is defined on Pns as 
follows: 



Pns(oz) = P(f(a),I(a),F(a)), 



reduce the computation time the images were subsam- 
pled by a factor of four to 256x256 pixels. Thus the 
final pixel size for the images used in experiments was 
800 /jrn. 

Mammograms from the local database were digi- 
tized at 50 /xi n per pixel and 12 bit depth resolution. 
The images were subsampled by the factor of 8, so that 
the final pixels size was 400 /im. 

A set of 55 mammograms was selected from the 
Mini-MIAS database and another set of 37 mammo- 
grams was taken from the local database. The only 
criterion for selection was that at least one mass-like 
object was present and annotated by a radiologist with 
expertise in mammography. 

6 Method 



where 



6.1 Mass coverage ratio 



T{a) 



T a (i,j) 



Mb j) 

T a (i,j ) 



T, I < a 
T a , I > a ’ 

i-\-w / 2 j -\~w / 2 



l 



W X W 



Y Y T ( m > n )> 



i=i—w/2 n=j—w/2 

Sr(i,j) — St min 

St max — 8 t min 

— T(i,j)\, 

i-\-w / 2 j-\-w/2 



l 



W X w 



Y Y T a (m,n). 



m—i—w/2 n—j—wj 2 



The cc-filtering is performed on the image until the rela- 
tive entropy of the indeterminate set I is smaller than a 
specified threshold. In this study we used a = 0.85 for 
all images, w = 4, for Mini-MIAS images, and w = 6 
for our local database. The entropy thresholds were 
set to 10 -3 and 5 x 10 -6 for Mini-MIAS and the local 
set, respectively. These values were determined empir- 
ically. 

5 Databases 

Two mammographic databases were used to test 
the method: the publicly available and widely used 
Mini-MIAS mammographic database [14], and a local 
database of mammograms. 

The spatial resolution of Mini-MIAS images is 200 
/xm per pixel and the depth resolution is 8 bit. All im- 
ages in the database are of 1024x1024 pixels in size. To 



The ultimate goal of this study is to provide a rig- 
orous method of selecting automatically k value which 
results in a usable segmentation of all mass regions. 
Depending on the application ’usable’ may be as little 
as 40% of the mass area segmented properly, if only 
detection is concerned, or 70% or more if also classifi- 
cation is involved. However, it is critical that no mass- 
like object is missed, so the emphasis is more on being 
able to obtain a minimum coverage of masses than be- 
ing able to segment accurately most mass-like objects 
but totally missing some of them. To meet this goal 
an appropriate measure of the segmentation quality is 
necessary. 

We adopt measure 1Z called the mass coverage ratio 
([7]). 77 was computed for each mass-like region and 
each segmentation. Since an exact representation of 
the mass-like region is not necessary and not even pos- 
sible due to undefined boundaries for masses, the sub- 
sequent processing includes the merging of components 
if, in aggregate, they are more mass-like. Accordingly, 
a suitable criterion for measuring the quality of the seg- 
mentation of mass-like regions is to compute how well 
the union of all reasonable components which overlap 
the annotated region fits the annotation. By a reason- 
able component is meant a component with at least 
half of its area residing within the annotated region. 
Thus a component C is said to overlap the annotated 
region T if 

\cnr\ > |C\T|. 

The mass coverage ratio is then defined by 



\A n Tj 
[A U T| ’ 



(2) 
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where A = U C) and the union is taken over compo- 
nents that overlap T. 

Since the MST segmentation parameter k roughly 
determines the area of the smallest component in terms 
of numbers of pixels that can be retained, and circu- 
lar objects having 3-5mm diameter are most important 
mass candidates the appropriate range of k values can 
be estimated based on the spatial resolution of images. 
For Mini-MIAS mammograms, those objects have esti- 
mated area 27-34 pixels, while for images from our local 
database the area ranges from 110-134 pixels. Hence, 
the reasonable ranges for k were estimated as 27-34, 
for Mini-MIAS mammograms and 110-134, for mam- 
mograms from the local database. 

6.2 Noise reduction 

Three methods for noise reduction were tested for 
use prior to MST segmentation. An anisotropic filter 
(see e.g. [11]), the symmetric Gaussian filter of size 6 
(4 for Mini-MIAS images) with standard deviation a = 
0.8, and the neutrosophic filter described in Section 4. 
Figure 2 shows the range of mass coverage ratio for 
each mass in each database, obtained by varying k, 
after anistropic, gaussian and neutrosophic filtering. 

Regardless of the kind of filter applied, one cannot 
generally expect that there is a single value of k pro- 
ducing high 7Z values for each image. The local set 
of mammograms used in this study is an example of 
a collection where no such value exists. Gaussian and 
neutrosophic smoothing prior to segmentation reduced 
the range of 7Z values substantially while anistropic fil- 
tering reduced the range less. The graphs also show 
that for each image there exist values of k resulting in 
mass coverage ratio value of 40 % or higher. This obser- 
vation motivated the search for an automatic method 
for chosing an optimal value of k (in that it avoids dis- 
astrous mass coverage ratios) for each image. 

6.3 Optimizing MST 

To determine optimal values of the parameter k the 
following algorithm was followed. Every mammogram 
was segmented for the specified range of values of k as 
determined in Sec. 6.1. For each value of fc, the entropy 
of the segmentation was computed using the formula 
in (1). The value of k resulting in the smallest entropy 
segmentation was selected as an optimal one. 

In order to be able to quantify the results, approx- 
imate mass contours were drawn (within the radiolo- 
gist annotated areas) for each image. A single contour 
was used for each Mini-MIAS image (drawn by one of 
the authors), while for the more challenging local set 
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Figure 2. For each image index, the vertical 
line segment indicates the range of 1Z values 
obtained by various choices of k. (a), (c), 
(e) relate to Mini-MIAS images filtered with 
anistropic, gaussian and neutrosophic filters, 
respectively, (b), (d), (f) relate to local images 
filtered with anistropic, gaussian and neutro- 
sophic filters, respectively. 



of images two contours were drawn independently by 
the authors, and both union and intersection of those 
contours were tested. This compensates for subjective 
estimation of boundaries of difficult to draw contours. 

Image J software package ([12]) was used to annotate 
the images by the radiologist and to draw contours. 

7 Results 

Table 1 shows that all Mini-MIAS images were seg- 
mented successfully, that is, with 7 Z at least 40 %, when 
neutrosophic filtering was applied. Moreover, 69 % of 
images were segmented with a very high ratio (at least 
70%). For gaussian filtering the numbers are compa- 
rable while for the anistropic filtering they are signifi- 
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Table 1 . Mass coverage ratio by range for the 
Mini-MIAS set for each of the three filters. 
The first number gives the number of images 
falling into that range for some values of k. 
The corresponding number for k value se- 
lected with our method (the optimal value) is 
shown in brackets. Local database images 
were tested for the union and intersection of 
two contours drawn by the authors according 
to the radiologist annotations. 
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Figure 3. The performance of MST segmenta- 
tion for the three filters on the Mini-MIAS set. 
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Figure 4. The performance of MST segmenta- 
tion for the three filters on the local data set 
for union of contours. 



cantly worse. 

For images from our local database the ratios were 
obtained for both union and intersection of two inde- 
pendently drawn contours by the authors, according 
to the radiologist annotations. These are shown sepa- 
rately in Table 1. In all cases the superiority of neu- 
trosophic filtering is visible. It allowed for about 50 % 
reduction of number of potential failures (those less 
than 40 % segmented). The results for the gaussian fil- 
tering were comparable, while the anistropic filtering 
produced significantly worse outcomes. 

Figures 3, 4 and 5 show plots of the proportion of 
correctly segmented masses as functions of the over- 
lap threshold used to define successful segmentation 
for various versions of segmentation methods discussed. 
The graphs indicate that the performances of MST 
segmentation for gaussian and neutrosophic filters are 
comparable and outperform the segmentation for im- 
ages with anisotropic filtering. 



Looking at the details of ratio values on image-by- 
image base, Figures 7, 6 we also conclude that the 
anistropic filtering is not appropriate for MST segmen- 
tation and the neutrosophic one outperforms both the 
gaussian and anistropic filtering in terms of minimiz- 
ing the number of failures (images with less than 40 % 
segmentation ratios) . 

Comparing Figures 3, 4 and 5 to the one obtained 
in [7] (indicative only since different databases were 
used) we see that MST segmentation is capable of pro- 
ducing radiologist-like outlines. 

8 Discussion 

There were two images (out of 37), in our local 
database set, where (regardless of the filter used) the 
method did not work. That is, in those cases the op- 
timal mass coverage ratio was smaller than 40 % both 
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Figure 5. The performance of MST segmenta- 
tion for the three filters on the local data set 
for intersection of contours. 



for the union and intersection of contours. 

Figure 8 shows the images annotated by a radiol- 
ogist together with their optimal segmentation. It is 
evident from the images that the mass is hardly visible 
in the first case, and is literally invisible in the second. 
In both cases the boundaries are extremely hard to de- 
tect. Very low saliency of the objects resulted in over- 
segmentations for most values of k. Since the entropy 
method favors the segmentation in which a few seg- 
ments dominate (similarly to human perception) (this 
was recently proved rigourously in [17]) it assigned the 
best score to an oversegmented image partition, (shown 
in Figure 8 (c), (d)). 

9 Conclusion and further work 

The study shows that automatic tuning of MST seg- 
mentation by using a measure of entropy to select one 
of several possible segmentations has a potential to de- 
liver useful delineation of mass-like objects in mammo- 
grams for the purpose of CAD systems. It is capable of 
producing very high mass coverage ratio for majority of 
mammograms and, more importantly, producing very 
few failures. This is a critical issue, since the outcome 
of the segmentation will become an input into CAD and 
mammogram registration methods. A disadvantage of 
the proposed method is that every image must be seg- 
mented several times before the best segmentation can 
be selected. 

It is expected that for some applications the ob- 
tained approximate contours will be subject of further 
processing using one of the numerous region-merging 
algorithms available in literature (e.g. [13] or [5], [1]) 
and active contour models (e.g. [6], [16]) to futher im- 
prove their accuracy before being used for detection 






(c) (d) 





(e) 



(f) 



Figure 6. Comparison of U values for the lo- 
cal set. (a), (c), (e) For union of contours 
for the anisotropic, gaussian and the neu- 
trosophic filter, respectively, (b), (d), (f) For 
intersection of contours for the anisotropic, 
gaussian and the neutrosophic filter, respec- 
tively. For each image index, the vertical line 
segment connects the worst and the optimal 
ratio values. 



algorithms. 
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